SOLUTION-FREE SETS FOR SUMS OF BINARY FORMS 



SEAN PRENDIVILLE 



Abstract. In this paper we obtain quantitative estimates for the asymptotic 
density of subsets of the integer lattice Z 2 which contain only trivial solutions 
to an additive equation involving binary forms. In the process we develop an 
analogue of Vinogradov's mean value theorem applicable to binary forms. 



1. Introduction 



Certain systems of linear equations have the property that, should a set of integers 
fail to deliver non-trivial solutions to the system, then the set has zero density. 
The problem of obtaining quantitative asymptotic estimates for the density of such 
sets, first addressed successfully by Roth [TU [15], is one which has seen remarkable 
advances over the last decade; spectacularly in the work of Gowers [6] on long 
arithmetic progressions, Bourgain [3j E] on progressions of length three and Green 
and Tao [7] on progressions of length four. Recently, M. L. Smith [19\ has obtained 
density estimates for sets of integers which do not contain solutions to a class of 
homogeneous equations involving kth powers. This was the first general result on 
inherently non-linear systems. In this paper we not only generalise Smith's result 
from an equation involving kth powers to one involving binary forms, but we also 
extract density estimates for subsets of the two-dimensional integer lattice. Our 
approach uses the density increment method of Roth and Gowers, together with the 
circle method. A notable feature in our application of the circle method is a novel 
analogue of Vinogradov's mean value theorem, applicable to systems of equations 
involving binary forms. Our approach to this mean value theorem makes intrinsic 
use of the structure of the shift-invariant system associated with our equation, and 
thereby improves on those estimates which can be deduced from the much more 
general work of Parsell [13] on multi-dimensional versions of Vinogradov's mean 
value theorem. 

In order to describe our conclusions, we first introduce some notation. When 
<3? £ Z[x,y] is a binary form we write & u ' v for the derivative q x uq v v &(x, y). 

Definition 1.1. Let us say the tuple c = (cr, . . . ,c s ) of non-zero integers is a non- 
singular choice of coefficients for $ if there exist binary forms $i, . . . , <&jv satisfying 

{$i, C {$ u ' v : < u + v < k} 

r ~ , (1-1) 

C span {$i,..., <f> N }, 

such that the auxiliary system of equations 




(1.2) 
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has non-singularQ real and p-adic solutions for every prime p. 

Definition 1.2. We call a 2s-tuple (xi, . . . , x s ) diagonal if there exists an afflne line 
L = a + R • b such that Xj G L for all i. 

Writing [X] for the set {1,2,..., [X\ }, the most accessible of our density results 
can now be stated. 

Theorem 1.3. Let G W,[x,y] be a binary form of degree k > 2 and let c£Z s be 
a non-singular choice of coefficients for with c\ + ■ ■ ■ + c s = . Suppose that 
s > |/c 3 log/c(l + o(l)). Then any set A C [X] 2 containing only diagonal solutions 
to the equation 

ci$(xi) + --- + c fl $(x a ) = (xj G A), (1.3) 

satisfies the bound 

\A\ ^X^loglogX)- 1 ^ 5 - 1 ), (1.4) 
where the implicit constant depends only on c and $. 

Remark 1.4. For a more precise lower bound on the number of variables required 
than s > |/c 3 log/c(l + o(l)), see Theorem 15.11 

For comparison, recent work of Smith [TB] establishes a version of the above result 
in which $ is replaced by a kth power and the set A is a subset of the integers in 
the interval [1,-X]. Indeed, our insistence that A contains only diagonal solutions 
to (|1.3p precludes the deduction of Theorem 11.31 from Smith's result. We also note 
that Smith obtains an exponent of log log N in (jl.4p of the form — 2~ 2 . 

One can obtain a qualitative version of Theorem 11.31 by applying the multidimen- 
sional Szemeredi theorem of Furstenberg and Katznelson [5]. In this way, one can 
show that any (infinite) set A C Z, 2 containing only diagonal solutions to (|1.3p must 
have zero upper Banach density. If one had a quantitative version of the multi- 
dimensional Szemeredi theorem providing bounds analogous to the one-dimensional 
bounds of Gowers [6], then one could use this result to obtain bounds of the form 
(|l,4p in Theorem ll.3l However, the exponent of log log N in these bounds would be 
intrinsically dependent on the choice of form $ and coefficients c\ , . . . , c s , whereas 
our result depends only on s. Moreover, no such two-dimensional bounds currently 
exist; the best bounds presently available are due to Shkredov [T7] and are not 
general enough for our purposes. 

To obtain Theorem II. 3\ we bound the density of sets which contain only diagonal 
solutions to the larger system of equations 

ci$ u ' v (x 1 ) + --- + c s $ u ' u (x s ) = (0<u + v<k). (1.5) 

Sets avoiding non-diagonal solutions to this larger system may have greater size 
than those avoiding non-diagonal solutions to (jl.3p . However, a key observation is 
that this larger system enjoys translation- dilation invariance, in that (xi,...,x s ) 
satisfies (|1.5p if and only if (Axi +£,..., Ax s + £) satisfies (|1.5p . whenever A / 0. 
This invariance allows us to adapt the density increment method of Roth and Gowers 

MM- 

In order to implement the density increment method it is necessary to have an 
asymptotic estimate for the number of solutions to (jl.5p with variables restricted to 
the interval [1,-X]- This we obtain through an application of the Hardy-Littlewood 



Here non-singular means the associated Jacobian has full-rank over the field in question. 
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method. In order to deal with the minor arcs, we utilise Vinogradov's method^, 
which necessitates the estimation of the number J S ^(X) of solutions (x, y) E [X] 4s 
to the system of equations 

s s 

J2* U ' V (xj)=J2* U ' V ^ (0<u + v<k). (1.6) 

j=l 3=1 

When takes the form a(bx + cy) k , such an estimate can be obtained from the 
standard Vinogradov mean value theorem, as found in [211 Chapter 5]. We must 
therefore treat the remaining case. 

Definition 1.5. We say a binary form 3> E 7*[x,y] of degree k is degenerate if it 
takes the form (ax + (3y) k for some a, /3 € C. One can check that <& is degenerate if 
and only if there exist a, b, c £ Z such that = a(bx + cy) k . 

Definition 1.6. We define the differential dimension of to be the dimension N 
of the linear span of the set of non-constant derivatives 

{$ u ' v : < u + v < k}. (1.7) 

Given a maximal linearly independent subset {F\, . . . , -F/v} of (|1.7p . we define the 
differential degree of $ to be the quantity 

K = ^degF,. (1.8) 

i 

Elementary linear algebra confirms that K is independent of our choice of F{. 

Our mean value theorem for non-degenerate binary forms is then the following. 

Theorem 1.7. Let <I> G Z[x,y] 5e a non- degenerate binary form of degree k, differ- 
ential dimension N and differential degree K. Write M = \N/2], and define 

A S = K(1-^) [S/Mi . (1.9) 

Then we have the bounds 

X is ~ K < J S ,$(X) < x 4s ~ K+A % (1.10) 
where the implicit constants depend only on s and <£. 

We remark that when $ is a degenerate binary form, then K = k(k + l)/2 and 
N = k. Hence our result is comparable to the standard Vinogradov mean value 
theorem, where one obtains 

Using very general work of Parsell |13j . one can extract a bound on the exponent 
A s in Theorem 11.71 of the form 

A s < rke 2 - 2s ' rk , 

where r = (k + 2)(k + 3)/2 — 1. By way of comparison, an immediate consequence 
of Theorem 11.71 is the bound 

A s <^e-^/av+i)J, 

and one certainly has K < rk and N < r. Moreover, Parsell's general theorem 
is obtained through the somewhat formidable method of repeated efficient differ- 
encing. We are able to extract our result from the comparatively simple p-adic 



2 See Chapter 4 of [TT] for a description of this method. 
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iterative method, originating with Linnik [10J, and reaching a refined state in work 
of Karatsuba [§] and Stechkin [20] . 

An expert in the field might hope to apply the above result via Vinogradov's 
method to obtain superior bounds for exponential sums over binary forms, at least 
when k is large. However, as demonstrated in Wooley [23, §8], one can already 
attain such bounds using the standard Vinogradov mean valued theorem. 

1.1. Notation. Throughout the remainder of the paper we fix a non-degenerate 
binary form <5 of degree k, differential dimension N and differential degree K. We 
reserve the letter M for the quantity fiV/2] . Let us also fix {F±, . . . , Fn}, a maxi- 
mal linearly independent subset of : < u + v < k}. Let F denote the tuple 
(F±, . . . , Fjv). Setting k{ = degFj, we always assume that k = k\ > &2 > • • • > k^ = 
1. Using Taylor's formula, a convenient consequence of our ordering of the F{ is that 
for any £ G 1? there exists a lower unitriangulaid matrix G GL n (Q) such that 

F(x + £) = 3 € -F(x) + F(0. (1.11) 

We call this property translation- dilation invariance, since it implies that for any 
^ £l 2 and A ^ we have the equivalence 

s s 

^(FfoO-Ffo)) =0 ^(F(Ax i + 0-F(Ay i + 0) =0. (1.12) 

3=1 3=1 

Given a real X > 1 write [X] for {1,2,..., [^J }• We use J S) $(X; m) to denote the 
number of (x, y) G [X] 4s satisfying 

s 

2(F( Xj )-F( yj )) =m. (1.13) 

3=1 

Notice that J Sj $(X;0) coincides with our definition of J S) $(X). 

We analyse both the equations (|l,5p and (|1.6[) via the exponential sum 

/(a) = /(a;X)= ^ e(a • F(x)), (1.14) 

xe[x] 2 

where e(y) = e 2my . By the orthogonality relations we have 

J s ^(X;m) = j> \f(a)\ 2s e(a-m)da, (1.15) 

where § denotes the integral over the TV-dimensional torus T N = M> N j% N . 

Throughout, we assume that X is sufficiently large in terms of s, c and F, so all 
implicit constants depend only on these parameters, unless otherwise indicated. We 
note that F depends ultimately only on <£. 

2. The Mean Value Theorem 

Before working towards upper bounds for J S $(JT), let us derive an elementary 
lower bound. By (|1.15|) . for any m we have J s ^(X;m) < J S) $(X). Notice that 
there are Of, s {X k ) values of m for which J s $(X;m) is non-zero. Summing over 
these values, we obtain 

X K J S ^{X) » X 4s . (2.1) 
The lower bound in (|1.1U|) follows. 



3 A lower triangular matrix with all diagonal entries equal to one. 
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The remainder of this section is occupied with proving the upper bound in (| 1 . 1 j) . 
We expect the majority of solutions to (|1.6p to be non-singular (in a sense to be 
defined later), whilst the remaining set of singular solutions should be relatively 
sparse. To define the appropriate notion of singularity neccesitates the discussion of 
the Jacobian associated to (|1.6[) . 

Definition 2.1. Write Jac(xi, . . . ,xm) for the iV x 2M matrix 

(if'°(xi), ^\ X] )) 1<t<N , (2.2) 
i<i<M 

and let A(xi, . . . ,xm) denote the determinant of the N x N matrix consisting of 
the first N columns of Jac(xi, . . . ,xjf). 

In order to get our version of Linnik's p-adic iterative method to work, A cannot 
be identically zero. Notice that if $ is degenerate, then A is identically zero. Our 
first lemma, Lemma [2.21 feeds into our second, Lemma [2. 31 which establishes that A 
is non-zero when and only when <3? is non-degenerate. We keep Lemma 12.21 separate 
as it proves useful later. 

Lemma 2.2. Suppose there exists 1 < I < deg such that the linear span of the set 
: u + v = 1} is one- dimensional. Then $ is degenerate. 

Proof. For any binary form $ of degree k and < / < k one can show by induction 
that 

Let Fi be the only form from F\ , . . . , Fn with degree k — I. Then for each r = 0, . . . , I 
there exists A r € Q such that $ /_r ' r = A r Fj. We must have A r ^ for some r. Let 
us suppose that r > 0, the case r < I being similar. We have 

= \~ 1 <!> l ~ r+1 ' r = A~ 1 A r _iF°' 1 . 

Letting A = A r T 1 A r _i and iterating one sees that for all < s < ki we have jpj ii ~ s ' s = 
\ki-s F o,h_ Using t hi s a nd ([22]), it follows that 

F o,k t k t , . 

~ hi 2^{r r x V 

r=0 

= {ax + f3y) ki 

for some real a and /3, with f3 ^ 0. Differentiating in y we see that 

Differentiating in x when r < I we also see that 

$l-r,r+i = Xr+l F?'° = X r+l aki{ax + Py) ki ~ l . 
Thus for each r = 0,1,... ,1, one obtains 

\ r = (a/(3)X r+1 = --- = (a/(3) l - r Xi. 
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Inputting this into 



we deduce that 



* = fl E (\ l ' r y r ^/fi) l - r Hax + fiyf- 1 



r=0 

HA, 



Therefore $ is degenerate. 



(ax + fly) . 



□ 



Lemma 2.3. If Q is a non- degenerate binary form, then the determinant A is not 
the zero polynomial. 

Proof. For each 1 < I < k, let 1(1) denote the set of indices i for which ki := deg-Fj 



{x l ,x l 



1 y,...,y l ) 



I. For each i G 1(1) there exists G such that Fj(x, y) = c/j 
Let C; denote the matrix whose rows comprise en (i G /(/))• Since the F{ are 
linearly independent, C\ has full-rank. Hence there exists an invertible matrix Bi 
such that B\C\ is a full-rank matrix in reduced row-echelon form. Define the rational 
homogeneous polynomials G\, . . . , Gn by 







\G N J 


1- 



V 



B 



1/ 











for all i. 



From our construction, we see that degGj = degFj for all i. Furthermore, if dj 
denotes the highest exponent of x occurring in Gi(x, y), then for any i,j G 1(1) with 
i < j we have > dj. Write Jac(xi, . . . , xm) for the iV x 2M matrix 

(^•°( Xi ), G^(^)) 1<l<N , (2.4) 

l<j<M 

and let A(xi, . . . ,xjk) denote the determinant of its first N columns. By linearity 
of differentiation, we have 



(B k 



Jac(xi 



,x M j 



Bk- 



k-l 



Jac(xi, . . . ,x M ). 



Since the matrix with the Bi along the diagonal is non-singular, it suffices to prove 
that A(xi, . . . ,xjk) is not the zero polynomial. 

For r in the range 1 < r < define D r (xi, . . . ,x r ) to be the determinant of the 

2r x 2r matrix occurring in the bottom-left corner of Jac(xi, . . . , xm )■ We induct on 
r to show D r is not the zero polynomial. When N = 2M this completes the proof, 
since in this case Dm = A. When N + 1 = 2M we expand along the iVth column 
of the matrix associated to A to obtain 



N 



A 



-t>M-i(xi, . . . ,x M -i)G} ,0 (x M ) +y^Pj(xi, . . . ,XM-i)G i '°(x M ) 



(2.5) 



for some polynomials P2, ■ ■ ■ ,Pn- Since F\ is the only form of degree k\ = k and 
is non-degenerate, G\(x,y) does not take the form cy kl . It follows that G 1 '°(x,y) 
is a non-zero polynomial of degree k — 1, a degree higher than that of any other 



G]'°(x,y). Since Dm-i(xi,...,xm-i) is also a non-zero polynomial, we can use 
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(|2.5p to compare the exponents of the monomials in A which feature x^f, and thereby 
deduce that A cannot be zero. 

It remains to show that D r is non-zero for each 1 < r < N/2. We begin with a 
claim. 



Claim. For 2 < i < N the polynomial 



Wi(x,y) 



n l,0 ^0,1 
W-l 

q1,0 q0,1 



G M) G o,i _ G or G i,a (2>6) 



is non-zero, of degree fcj__i+/cj — 2 and u>ii/i highest exponent of x equal to di-i+di — 1. 

Recalling that di denotes the highest exponent of x occurring in Gi(x, y), consider 
the polynomial 

di-ix di - 1 ~ 1 y kl - 1 ~ di - 1 — di^i)x dx - 1 y kx - 1 ~ dl - 1 ~ 1 

diX d% ~ x y k% ~ di [ki — di)x di y ki ~ dl ~ 1 

= (di^ih ~ di) - di(h-i - d i - 1 ))x di -i +d *- 1 y ki - 1+ki ~ di - 1 ~ di ~ l - 
If this is non-zero then, by our construction of the Gj, it has the same leading 
monomial and coefficient as Wi (when we order monomials according to the lexico- 
graphicaj^ordering on their exponents). To establish the claim it therefore remains 
to show that 

- di) - diiki..! - di-x)) + 0. (2.7) 

Suppose otherwise. Then 

di-iki = dih-x. (2.8) 
There are two cases to consider. In the first case ki = from which it follows 

that di-i = di. However, this contradicts our construction of the Gj. The only 
other possibility is that ki-i = ki + 1. In this case ki-\ and ki are co-prime, so we 
must have dj_i = ki^\ and di = ki. Our construction of the Gj therefore ensures 
that Gi-i is the only Gj of degree ki-±, since it has the highest index of any Gj of 
degree but also has highest exponent of x equal to fef_i. This forces $ to be 
degenerate, by Lemma 12.21 a contradiction which establishes the claim. 

Notice that Z?i(xi) = Wn(*-i), so D\ is a non-zero polynomial by the claim, 
giving us the basis case of our induction. Let us suppose that D r -\ is non-zero, with 
r < N/2. Inspection reveals that D r is equal to 

Y, Pij(*i, ■ ■ ■ ,x r -i)(Gj'°(x r )G;' 1 (x f .) - Gf (Xr^^X,.)), (2.9) 

N-2r<i<j<N 

where the Pij are polynomials with P$j = D r -\ when 

{j, j} = {N - 2r + 2, N - 2r + 1} . 

Let Wij denote the polynomial G^G®' 1 — G^G®' 1 . The degree of the Wij occurring 
in (|2.9p is maximised only when ki = /cjv-2r+i and kj = k^-2 r +2- In this case, the 
highest exponent of x occurring in Wij is strictly less than djy-2 r +i + aV-2r+2 — 1) 
unless i = N — 2r + 1 and j = N — 2r + 2, in which case Wij = W^-2 r +2- It follows 
from the claim and the induction hypothesis that the term 

-P/V-2r+l,iV-2r+2(xi, • • • , X r _i )Wn~2t+2 (x r ) 

has a monomial occurring in no other term of the sum (|2.9p . hence D r is itself 
non-zero. The lemma now follows. □ 



So (oi, . . . , a„) -< (6i, . . . , b„) if there exists i with di < hi and a.j — bj for all j < i. 
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The p-adic iterative method yields a congruence relation amongst the variables 
of equation (jl.6|) . In order to use this relation to provide an iterative bound on 
J S) $(X), we need to count the number of solutions to such a congruence. This is 
the purpose of the next lemma. 

Definition 2.4. Given er G {-1,1} M , m G Z N , £ G I? and a prime p, define 
Bp (m; £) to be the set of solutions (xi, . . . , xa/) modulo p fc of the system of equations 

M 

OjFi{*j -€) = m i ( mod P k ') (1 < * < (2.10) 

i=i 

satisfying the additional condition that A(xi, . . . , xm) ^ mod p. 
Lemma 2.5. VFe have the upper bound 

\B°{m-i)\<kf-k N p 2Mk - K . (2.11) 

In order to prove Lemma 12.51 we record a simple generalisation of Lagrange's 
theorem on the number of roots of a non-zero polynomial over an arbitrary field, a 
result which proves useful elsewhere. 

Lemma 2.6. Let ¥ be a field and P G F[Xi, . . . , X m ] a non-zero polynomial. Let a% 
denote the highest exponent of Xi occurring in P. If A C F is finite then 

# {x G A m : P(x) = 0} < (ai + • • ■ + a m )\A\ m -\ 

The proof is a simple induction on the number of variables m, which we leave as 
an exercise for the reader. 

Proof of Lemma \2.5[ Let T>(n) denote the number of elements (xj, . . . , Xm) in the 
set Bp (m;£) satisfying the stronger congruence 

M 

^^(xj-O^n (mod/). 

3=1 



Then 



|^(m;|)|< £ ... £ 2>(n) 

l<rti<p fe l<njv<P* 
niEmi mod p'l n^=m^ mod p fc iv 

< p( fc7V -^)maxP(n). 



(2.12) 



For each tuple (a?i, x 2 , ■ ■ ■ , X2M-i,%2m) counted by T>(n) there are at most p( 2M ~ N ) k 
choices for X{ with i > N. Fix such a choice, and define the polynomials 

M 

fi(xi, . . .,x N ) = } j ajF i (x 2 j^ 1 - £i,x 2 j -6) ~ n i (1 < * < ^0- 
i=i 

Then /i, . . . , /jv are polynomials in Z[xi, . . . , xjv] with deg fi = k{. By Theorem 1 
of Wooley [22], the number of integer tuples 1 < (xi, . . . , xn) < p k satisfying both 

f i (x 1 ,...,x N ) = mod/ (l<i<N) 

and 

det(^-(x)) ^0 mod p 

V OXj I i,j 
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is at most (deg/i) • • • (deg/j\r) = ki---k^. One can check using that for 

(u, v) = (1,0) or (0,1), we have 

l>i 



Hence it follows that 
det( 9f ' 



. v Xl, ...,x N 
\OXj / 1,3 



|A(xi,2 2 , • • • ,X2M-1,X 2 m)\- 



So there are at most k\ ■ ■ ■ k^ choices for (x\, ■ ■ ■ , xn) with (xi, . . . , X2m) counted 
by V(n). Thus 

V{n) <k 1 ---k NP ^ M ~ N '> k . (2.13) 
Putting ()2.12p and ()2.13|) together, we obtain the lemma. □ 

Lemma 12.51 allows us to count non-singular solutions, which we have still to de- 
fine. The remaining singular solutions are counted by the following lemma. First a 
definition. 

Definition 2.7. Define St(X) to be the set of 

(xi,...,x t ) € [X] 2t 

such that for any function h : [M] — >■ [t] we have the identity 

A( x h(i)> • • • ,*h(M)) = 0- 

Lemma 2.8. Let $ be a non- degenerate binary form of degree k and differential 
dimension N . Setting M = \N/2\, we have the upper bound 

\S t {X)\ <Mt M (2k) t X t+M - 1 . (2.14) 

Proof. The result follows trivially if t < M, so we may assume that t > M. Let 
us define a sequence of non-zero polynomials Aj(xi, . . . , Xj) for i = 0, 1, . . . , M. We 
begin by setting Am = A. Suppose we have constructed Aj with i > 1. Let us 
write x a for the monomial x^x^ . Of the monomials x^ 1 • • • x^' occurring in Aj, let 
bj denote the maximum in the lexicographical ordering over all aj. It follows that 
there exist polynomials Aj_i(xi, . . . , Xj_i) and Ri(xi, . . . ,Xj) such that 

Aj(xi, . . . ,Xj) = Aj_i(xi, . . . ,Xj_i)x^ + i?i(xi, . . . ,Xj). (2.15) 

Moreover, we may assume Aj_i is non-zero and that every monomial x^ 1 • • • x^ 1 
occurring in Ri satisfies aj -< bj, where -< denotes the (strict) lexicographical or- 
dering. For consistency, let us set Ao = 1 and R\ = 0. For each i in the range 
1 < % < M, define % to be the set of (xi, . . . ,x^) G [X} 2t satisfying both of the 
following conditions: 

(i) For any h : [i] —¥ [t] we have 

A«( x /i(i)> • • • ,x/j(j)) = 0, 

(ii) For each j < i there exists h : [j] — > [t] such that 

Aj(x h(1) , . . .,x-h(j)) / 0- 
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Then we have that 

S t (X)c (J % 

l<i<M 

Let (xi, . . . , xjv/ ) G %. Then there exists some h : [i — 1] — > [t] such that 

A«-i( x /i(i) 5 • • • >Xh(i-i)) ^ 0, 
yet for all j £ {h(l), ... ,h(i— 1)}, the identity fj2. 15j) tells us that 

= Ai_i(x ft(1 ),. . . ,x /l(i _ 1) )x^ +i? J -(x /l(1 ), . . ^x^.^Xj)- (2.16) 

Since the two-variable polynomial 

Q(X) = A i _i(x /l ( 1 ), . . . ,x /l ( i _ 1 ))X bj + Rj(x h ^),. . . ,x.h(i-i),X) 

is non-zero, it follows from Lemma 12.61 that for each j, the number of Xj satisfying 
(|2.16|) is at most 

(bji + b j2 )X < 2kX. 

There are trivially at most X 2 ^ 1 ^ choices for (x^m, . . . j'X-hU-i))} an d a ^ most 
choices for h : [i — 1] — ¥ [t] . Thus 

\Ti\ < t i -\2k) t - l+1 X 2 ^ +{t - i+l ^ =t M (2k) t X t+i - 1 . 

Hence 

\S t (X)\ < Mt M (2k) t X t+M ~ 1 . 

□ 

We can now implement the results obtained so far in this section to prove the 
following lemma, which encodes the basic iterative relation underlying our p-adic 
approach to bounding J Sj $. Again, we begin with a definition. 

Definition 2.9. Given a prime number p, £ G Z 2 and a G {— 1,1} , define the 
exponential sums 



f p (a;0= E e ( Q ' F ( x ))> 



xe[x] 2 

x=£ mod p 



M 



( Xl ,... )XM )6[X] 2A? 3=1 
A(xi,...,xj\,/)^0 mod p 

Lemma 2.10. Let s > M. T/ien i/iere exists £ G Z 2 , er G { — 1, 1} M and a prime p 
in the range X x l k <p< 2X l / k such that 

JsMX) « X 2s+M - X + p^~ M ) j |^(a)| 2 |f p (a;0| 2(s - M) da. (2.17) 

Proof. If (xi, . . . ,xm) G [X] , then each ij-entry of the matrix Jac(xi, . . . ,xjvf), 
defined in (j2.2|) . is of order 0{X ki ~ l ). Hence there exists a constant C = C($) such 
that for any (xi, . . . , x^r) G [X] 2M we have 

|A( Xl ,...,x M )| <CX K ~ N . 
By the prime number theorem, for all X 3><j, 1 we have 

vr(2X 1 / fc ) - n{X l ' k ) > H ° gC + k(K - N). (2.18) 

log X 
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Let T be the smallest positive integer bounded below by the right-hand side of (|2.18|) . 
and let V denote the set of the T smallest primes in the interval X l / k <p< 2X 1 / k . 
Then 

l[p>x T / k >cx K - N . 

In particular, for each (xi, . . . ,xm) 6 [X] 2M with A(xi, . . . , x^f) ^ 0, there must 
exist p € V such that 

A(xi, . . . ,x M ) ^ (mod p). 
Now J Sj $(X) counts tuples (xi, . . . ,X2 S ) G [X] 4s which satisfy 

s 

^(F(x 2i _i) - F(x 2 i)) = 0. (2.19) 

i=l 

Let T denote the number of such tuples which are not contained in S2 S (X). Then 
by Lemma 12.81 we have 

If (xi, . . . , X2 S ) denotes a tuple counted by T, then it satisfies ()2.19p and there exists 
h : [M] — > [2s] such that A(x^m, . . . , 'X-hiM)) 7^ 0- Hence for each such choice of 
tuple and function h, there exists a prime p &V such that 

A(x ft(1 ),...,x ft ( M ))^0 (mod p). 

Notice that by the definition of the determinant A, such a choice of h must be 
injective. Since there are 0(1) choices for h, and 0(1) choices for a prime p € V, we 
see that there exists p and a E {—1, 1} such that 



T« j\r p (cx)\\f(cx)\ 2s - M &ot 



< ( & |^(a)| 2 |/(a)| 2 ( s - M )d«V /2 J^(X)^. 



Thus 

J., 9 (X) « X 2 ^- 1 + I \$°( a )\ 2 \f(a)\ 2 ( s - M ^da. (2.20) 
By the triangle inequality 

| /(Q) |2(-M) = | £ fp(a;0 | 2(S - M) < p 4( S -M) |fp(a;0|2 ( S -M)_ 

Incorporating this into (|2.20p . we obtain the lemma. □ 

The following lemma eventually allows us to conclude that the first term on the 
right-hand side of (|2.17p is smaller than our hoped for upper bound. 

Lemma 2.11. Let $ be a non- degenerate binary form of degree k, differential di- 
mension N and differential degree K . Set M = \N/2 \ . Then we have the inequality 

K/k<M + \. (2.21) 

Proof. We establish this result in a series of claims. Let Ni denote the size of the 
set {i : ki = I}. 

Claim 1. Let 2 < I < k. Then iV/_i > Ni or N t = I + 1. 



12 
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Let Gi, . . . ,G m denote a basis of forms for the space spanjFj : ki = I}, so that 
m = Ni. Let dj denote the degree of the one variable polynomial Gj(x, 1). By 
performing a linear transformation we may assume that d\ > d% > • • • > d m . 

Suppose there exists i for which di < I + 1 — i. Let i denote the minimal such 
index. Then each of the one variable polynomials Gj (x, 1) with 1 < j < i has 

degree dj — 1, whilst each of the polynomials G^' (x, 1) with j > i has degree dj. 
Since 

di-l>d 2 -l>---> di-i - 1 = I + 1 - i 

> di > ■ ■ ■ > d m , 

the polynomials G\'°, . . . , G^, G®' 1 , . . . , Gm are a linearly independent subset of 
the space span{Fj : fej = I — 1}. It follows that iV^ > iVj. 

Next suppose that for all i we have = Z + i — 1. If Ni < I + 1 then, as above, 
G\' , . . . , Gm° form a linearly independent subset of span {Fi : ki = I — 1} of size JVj 
and we are done. The only remaining possibility is that Ni = I + 1. This establishes 
Claim 1. 

Claim 2. For 2 < / < k we have the inequality 

Ni-i <Ni + l. 

Let Gi, . . . ,G m denote a basis for span{Fj : ki = I}. Then for each u,v with 
I = k — u — v, the form & u ' v is a linear combination of Gi, . . . , G m . It follows that 
each Q u+1 > v is a linear combination of G\' , . . . , Gm°- If u > then $ n ' 1 '+ 1 is also a 
linear combination of the Gi' , since (u, v + 1) = {v! + 1, f') for some u', u ' > and 
I = d — u' — v' . Thus 

spanjF, : ^ = / - 1} = span . . . , $M-J+i\ 

Clearly this latter space has dimension at most iVj + 1, which is what we require. 
Claim 3. If 1 < I < k/2 then N k ^i < Ni. 

Let 1 < I < k/2. If Ni < i + 1 for all I < i < k — I then by Claim 1 we have 

N k -i <---<N t+1 < Ni. 

Next suppose iVj = i+1 for some I < i < k — l. Since span{Fj : fcj = i} is a subspace 
of the (i + l)-dimensional space 

span \x l ~ 3 y 3 : < j < i\ , 

these spaces must in fact coincide. Taking derivatives, we see that span {Fj : kj = 1} 
coincides with span {x z_J V : < j < Z}, so JVj = Z + 1. By Claim 2, we have 

A^-z < + 1 < • • • < iV fc + Z < 1 + Z = iV,, 

the last inequality being a consequence of = 1. This establishes Claim 3. 

Finally, we use Claim 3 to prove the lemma. Observing that \ + M > (N + l)/2, 
the inequality (|2.2ip therefore reduces to showing that 

i=i i=i 



SOLUTION-FREE SETS 13 

Re-arranging, we need only show 

E (z-!)^<§+ E S-0"i- 

k/2<l<k l<Kk/2 

Changing variables from Z to — Z on the left hand side leaves us the task of proving 

lN k + E (t-0^-/<t+ E (1-0^- 

\<l<k/2 l<l<k/2 

This last inequality follows from Lemma 12.21 and the fact that iV& = 1. □ 

The final lemma proved before we deduce Theorem 11.71 bounds the second term 
on the right-hand side of (|2.17p . 

Lemma 2.12. Suppose that s > M and X 1 ^ < p < X. Then 

y"|^(a)| 2 |f P («;^)| 2 ( s - A/ )da<<X 2A V Mfc - X ^-A/,<i.(2X/p). (2.22) 

Proof. The left-hand side of (|2.22p counts tuples (xi,yi, . . . , x s ,y s ) G [X] 4s satisfy- 
ing the Diophantine equations 

M s 

^a,(F(x J )-F(y J ))= £ (F( Xj ) - F(y,)) , (2.23) 

3=1 j=M+l 

under the additional constraints that both A(xi, . . . , xm) and A(yi, . . . , y^f) are 
non-zero modulo p, and for all j > M we have Xj = yj = £ (mod p). Translation- 
invariance (j!.12[) and homogeneity together imply that 

M 

<Tj (Fi(xj - - Fifyj - 0) = mod p k * (l<i<N). 

3=1 

Fix a choice of (yi, . . . , y^) € [X] 2M and set 

M 

3=1 

Let x denote the residue class of x G Z modulo Then (xi, . . . ,%) G £>p (m; £). 
Since p fe > X, the map 

(xi, . . . ,x M ) (xi, • • • ,xm) 

is injective when restricted to [X] 2M . Hence, there are at most \B" (m;£)| choices 
for (xi, . . . ,xjf). Set 

M 

n = E °o (^( x 3 - - - i)) ■ 

3=1 

Then for each fixed choice of (xi,yi, . . . , xj^,yjy), the number of choices for the 
remaining Xj,yj (j > M) is at most 

y |f p (a; 0| 2 (*- M )e(-a • n)da < ^ |f p (a; 0| 2(s " M) da. 

Employing Lemma 12.51 it remains to establish that 

j |f p (a;^)| 2 *da « 4 J t (2X/p). (2.24) 
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We may assume £ £ [p] 2 . The integral in (|2.24p then counts the number of solutions 
to the system 

t 

^(F(^+px i )-F(^+py i )) = 0, Xi , yi G 

8=1 

By (|1 . 12[) and homogeneity, this equals the number of solutions of the system 
t 

£(F(xO - F(y<)) = 0, Xi , yi G 
i=l 

Since we may assume X > p, the result follows. □ 

To conclude this section, we prove our mean value theorem. 

Proof of the Theorem ]!.^ We proceed by induction on [_ S /-^J — 0- The basis case 
is equivalent to J S ^(X) <C X 4s , which is trivial. 

Let us suppose that L S /-^J — 1- Combining Lemma 12.101 and Lemma 12.121 we 
see that there exists a prime p in the interval (X 1 ^, 2X x ' k ] such that 

J S MX) « X 2s+M ^ + p^- M ^ K X AM J s _ M M2X/p). (2.25) 
Combining this with our induction hypothesis implies that 

J. t *{X) < x 2 **"- 1 + x 4s ~ K+A °. 

It therefore remains to show that 2s + M — 1 < 4s — K + A s , which we also prove 
by induction on [ S /-^J — 1- The basis case follows directly from the estimate 
K/k < M + \ of Lemma I2TTT1 Let us suppose \sjM\ > 2, then by induction 
2s + M - 1, being equal to 2M + 2(s - M) + M - 1, is at most 

2M + 4(s — M) — K + K(l - I) LxrJ " 1 = 4s - K + A s + f (1 - ±) LwJ - 1 - 2M 

< 4s - K + A, + f - 2M. 

Since if = J2iLi h < -Wfc and ^ < 2M, we have § - 2M < 0, which completes the 
proof. □ 






X 





' p 




' p 



x-L 



+ 1 



v-6 



+ 1 



3. Weyl-type estimates 

Definition 3.1. Let us say A = A s is an admissible exponent for (s,$) if there 

exists a constant C = C(s, <J>) such that for any X > 1 we have the bound J S; $(X) < 
QY&s—K+A 

The aim of this section is to prove the following Weyl-type estimate. 

Theorem 3.2. Let A be an admissible exponent for (s, <£) and let a < g~?^ • T/iere 
/or any e > i/iere exists a constant C = C(s,$,e) such that if X > C and 

\f{oc;X)\>X 2 -% (3.1) 

then there exists integers g, oi, . . . , ajv such that 1 < q < X ka+£ and \qctj — a,j\ < 
-gka+e-kj j or a lli<j<N. 
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The proof of Theorem 13.21 uses Vinogradov's method, a general heuristic for which 
is neatly described in [51 §8.5]. We model our argument on a version of the method 
due to Vaughan [2U Chapter 5] , with a later improvement due to Baker [2J Chapter 
4]- 

Let 7y (£) denote the r/-entry of the matrix occurring in (jl.lip . Then for any 
y we have the identity 

F f (x + y) - Fi(y) = J>i(y)*>(x) = F f (x) + (y)^(x). 
Set _^ 

7i(y) = 7i(y;«) = E ar ^'( y )' 

then for all x we have 

AT 

a-(F(x + y)-F(x)-F(y)) =^ 7j (y)^(x). (3.2) 

i=2 

In the following result, and the remainder of the paper, we use ||/3|| to denote the 
smallest distance from (3 to an integer. 

Lemma 3.3. Let A denote an admissible exponent for (s,<3?) and let S be a subset 
of [X] 2 of size S such that for any distinct y, z € S there exists 2 < j < N with 

ll7i(y)-7iMII>*-*. (3-3) 

Then one has 

|/(a;X)| « X 2 log(2X) 2 {X A /S) 1/(2s) . (3.4) 
Proof. Averaging, we see that f(ot) is equal to 

^E E e(a-F(x + y))l [Jt]3 _ y (x). (3.5) 
ye-Sxe(-X,X) 2 

By orthogonality 

l[A]»-y(x) = /e(/3-x) ^ e(-/3-n)d/3. 

Since J2i- y < n <x-y e (~@ n ) — min jx, the sum /(a) is at most 

^ e(a-F(x + y)+/3-x)|min{x,||/3 1 |r 1 }min{x,||/3 2 |r 1 }d/3 

ye<s 
xe(-x,x) 2 

l0g(2X) 2 

< g sup> | g (y,/3)|, 
5 " y^ 

where 

g(y,P)= e(a-F(x + y) + /3-x). 

x6(-X,X)2 

It follows from Holder's inequality that there exists j3 £ T 2 such that 

|/(a)| 2 ^«^^j:| 5 (y,/3)| 2s . (3.6) 
yes 



16 SEAN PRENDIVILLE 

Let C = C(s, $) be any constant such that for all 1 < i < N we have 

s 

\Y i F i ( Xj )\<cx k * (^e(-x,x) 2 ). 

Define F'(x) = (F i (x)) 2 < J < iV , TV = \[ 2<i<N [-CX k \CX k ^} and 



a(n)= Yl H«-E F ( x i) + ^"E^ 
Then by ()3.2p we have 



xi,...,x s e(-x,x) 2 i=i i=i 

i=E?=iF'(xj) 



]T e(«-F(x + y)+ / 3-x)| 2S 

xe(-x,x) 2 

| £ e(a-^F(x i ) + /3.^x J )e( 7 (y)-E F/ (^) 

xi,...,x a G(-X,X) 2 3=1 3=1 3=1 

I \ ^ I 2 

= 1 2^ a ( n ) e (7(y) ■ n)| . 

By (j3.3j) and the multi-dimensional version of the large sieve inequality (see for 
example Vaughan |21|, Lemma 5.3]), we have 

^| J>(n)e( 7 (y) -n)| 2 « s ,* £ |a(n)| 2 (3.7) 

yeS nGAf neAf 2<i<JV 

Let ei denote the first standard basis vector. Recalling that J Si $(X; m) denotes the 
number of (x, y) £ [X] 4s satisfying 

s 

E(F(x<)-F(y i ))=m, 

i=l 

we have, by translation invariance, that 

El«( n )| 2 < E J s ,$(2X + l;mei), (3.8) 
A" m 

where the summation over m ranges over a set of size CX kl . If A is an admissible 
exponent for (s, then we deduce that the right-hand side of (|3.8p is of order 
0(X 4s "(^- fc i)+ A ). Putting these facts together with (|3?7|) . we obtain 



nv~\|2« ^ log(2X) 4s 4s+A 
\}{a)\ < s ,$ X . 



□ 



Remark 3.4. In the proof of Lemma [3.31 we used J S ^(X) to bound the number 
$(-X") of solutions (x, y) € [X] 4s to the smaller system of equations 

s 

XX^te) - ^(yO) =0 (« + « > 1). 

i=l 

We note that the methods of ^2] translate almost verbatim to yield the bound 
J' S ^(X) < x 4s -(^- fe )+ A , where for s = I \(N - l)/2] we have 

A<(K-fc)(l-^/. 
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This A is clearly superior to that obtained using J S ^(X). However, at the level of 
detail we are concerned with, this makes little difference to our final results, and 
increases the expositional complexity of ^5] considerably. 



In order to obtain a set S satisfying the spacing condition (|3.3p . we relate this 
condition to the Diophantine approximation of our original coefficients oti. This 
is the content of the following lemma. Unfortunately, the most direct approach 
allows us to control the spacing of the 7j(y) only according to the Diophantine 
approximation of a proper subset of the cej. We first define this subset. 

Definition 3.5. We assume throughout that denote sets of indices whose 

union equals {r G [N] : k r > 2} and such that if {i,j} = {1,2} then both of the 
following conditions hold 

span{F r (xi,x 2 ) : r G IJ D Q[xj] = {0} , (3.9) 

{F r (x 1 ,x 2 ):r^I i }cQ[x j ]. (3.10) 

Making a linear transformation of the F{ if necessary, we can always guarantee the 
existence of such I\ and I 2 - 

Lemma 3.6. Let m G {1,2}. There exists an absolute constant C = C(#) and a 
positive integer L < C such that for any y,z G [X] and a G T^, if 

hj(ye m ) - 7i(*em)|| < X~ k i for all 2<j<N, (3.11) 

then 

\\Lai(y-z)\\ < CX 1 -^ for alii el m - (3.12) 

Proof. Let us suppose that m = 1, the case m = 2 being similar. By Taylor's 
formula, we have 

fcj 1 rp 

Ft(x + yei) = Fi(x) + E '°(x) + Fi(yei). (3.13) 

r=l 



Since . . . , Fjy are a spanning subset of : < u + 1> < fc}, there must exist 

r < ki we have 

*T ,0 = E ( 3 - 14 ) 



rationals A^'° such that for 1 < r < h we have 



Combining (|3~13]l . (gJH) and ([321), we obtain 
7j(?/ e i) = 7j(yei;a) = ^ 



r=l 




E2/ (r) 

r=l 

Notice that for r > 2 we have 



E ^ = ^ ,0 = E ^r 1 * = E aSX 1 ^ 
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Hence, by linear independence, for all i,j and r > 2 we have A[j° = Y^tKt^ °\f- 
It follows that 

r>2 * \ t / 

k t =kj+r—l ki=k t +l 

and so jj(yei) — jj(zei) must equal 

C \ (1) . {y r ~ l j h ^ r ~ 1 ) ^ xr-1,0/ \ (1) 

(y-^)T,- + 2^ h Z> A *? (y- z )^t • ( 3 - 16 ) 

2<r<k-kj * t 

k t =kj+r—l 

Let Li = Li(<&) be a positive integer such that ^lAy is an integer for all i,j 
and r. It follows almost immediately from the identity (|3.16|) and induction on the 
difference k — kj, that there exists a constant C\ = Ci(3>) such that if (|3.1ip holds 
with m = 1, then for any 2 < j < N we have 



^ifcbf^y-z) <C X X-^. (3.17) 

Define the linear map 



ki ~ d ieh 

ki=d 

so that 

v J kj=d-\ ki=d 

We claim that each ^ is non-singular. To this end suppose that (3Ad = 0. Then a 
little manipulation shows that 



EW"= £ (£aa;; )f,- = o. 

k;=d 



It follows that 

E PiF i (x 1 ,x 2 ) € 



ki=d 



This contradicts Definition 13.51 unless /3 = 0. Hence for each d there exists a 
rational matrix such that A^B^ = (I | 0), where I denotes the identity matrix. 
We therefore have that 

„(i) 



Let L2 € N be such that all the matrices L^B^ (2 < k < k) have only integer entries. 
Then by (f3TT8j) and (|3"Tf|) . for all i e Ii we have 

Taking L = L2L1&., we obtain the lemma. □ 



The next lemma combines Lemma 13.31 and Lemma 13.61 a combination we record 
since we use it repeatedly in the proof of Theorem 13.21 
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Lemma 3.7. Fix m £ {1,2} and let C and L be as in Lemma \3. 61 Suppose there 
exists a real D > 1 such that for any y € [X], there are at most D elements z £ [X] 
satisfying 

\\Loii{y- z)\\<CX l - k > for alii £ I m . (3.19) 

Then we have 

\f(a;X)\^X 2 log(2X) 2 (X A D/X) 1/{2s) . (3.20) 

Proof. By Lemma [3.31 it remains to prove that our assumptions imply the existence 
of a set S C [X] of size |5| » XD^ 1 such that for any y, z £ S with y ^ z there 
exists 2 < j < N with 

||7j(ye m )- 7 j(*e m )|| >X~^. (3.21) 

By Lemma 13.61 the spacing (|3,2ip holds for some j if there exists i £ I m such that 
||Laj(y — 2)|| > CX 1_fci . Define G to be the graph on vertex set [X] with y adjacent 
to z if and only if \\Lai(y — z)\\ < CX l ~ ki for all i £ I m . This graph has maximal 
degree at most -D — 1, so (by the greedy algorithm) contains an independent set of 
vertices S of size at least \_X\ /D (as required). □ 

In order to use Lemma 13.71 to relate the size of f(ct) to the simultaneous Dio- 
phantine approximation of all the on, including those with fcj = 1, we must utilise 
major arc information. This necessitates the discussion of the standard major arc 
auxiliary approximation to f(ot). 



Definition 3.8. Define 

S(q, a) = V e (q^a • F(z)) , J(/3; X) = f e((3- F( 7 )) d 7 



ze[q] 2 

and 

V(at; q, a) = q~ 2 S(q, a)L(a — a/q; X). 

The following three results, which bound S(q,a), I(/3;X) and the difference 
f(ot) — V(ct; q, a), prove useful both in this section and the next. 

Lemma 3.9. Let q £ N and a £ 7j N . Then for any e > we have 

S(g,a)« £ (g,a)V^ +£ . (3.22) 

Proof. Letting q' = (q,a)~ 1 q and a' = (q,a)~ 1 a, we have S(q, a) = (q, a) 2 S(q', a'). 
Hence it suffices to assume that (q,a) = 1. Sorting the expression a • F into mono- 
mials and using the linear independence of the forms Fi, we see that there exists an 
integer matrix B with full row-rank such that 

a-F(x)= £ (aBjiX 1 

0<ii+t 2 <fc 

Set d = (aB, q), q' = d~ l q and b' = d^iaB). Then (q' , b') = 1, so by [1 :S Lemma 8, 
p. 54], we have 



S(q,a) = d 2 £ e (i £ 6'^) 

xe[g'] 2 V 0<n+i 2 <fc 7 

« E d 2 (g') 2 ' 1/fc+£ 
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It therefore remains to show that d <C$ 1. Since B has full row-rank, there exists a 
rational matrix B 1 with 

BB'=(l\0\, (3.23) 

where I is the identity matrix. Clearly there exists a positive integer m = 0$(1) 
such that mB' has only integer entries. Hence we have 

(max, . . . , Trio 7v ) 0, . . . , 0) = (aB)(mB ) = mod d. 

So d divides moj for all i. Since d\q and (q, a) = 1, we have d|m. Thus d <C$ 1. □ 

Lemma 3.10. For any e > i/ie auxiliary function I((3,X) satisfies 

I((3;X) ^ £ X 2 (l + X kl \(3 1 \ + ---+X kN \(3 N \Y" +£ (3.24) 



Proof. Let -B be the matrix in the proof of Lemma 13.91 Changing variables in the 
integral X) gives 

mx) = x 2 f 1 f\( V x*^03S), yW (3-25) 

■ /o 70 V 0<n+* 2 <fc ' 

Let /?i(X) = X fc »/?i, and Qi = (/3(X) • B), = X il+i2 (/373) ; . Then we can apply [1, 
Lemma 2, p. 50] to the double integral in (|3.25p to obtain 

/(/3;X)« £ X 2 min{l, \<x\^ k+E }, (3.26) 

where {ct^ = maxi \a{\. Using (|3.23f) . we have ^(X)!^ <C$ The result now 

follows. □ 

Lemma 3.11. Let q be a positive integer. Then for any a € Z n and a € T" 

f(a)-V(a;q, a ) «X + £ |?Oj - Oi|X*^ . (3.27) 

Proof. Write a = a/q + (3. Sorting the sum f(cx) into a sum over congruence classes 
modulo q, we have 



/(«) = e ( a • F ( r )/^) E E e (0 • + r )) • ( 3 - 28 ) 

By the mean value inequality we have 

e (/3 • F(gy + r)) - <T 2 / / e (0 ■ F( 7 )) d 7 « E k/Sil*** -1 . 

Summing over r and y shows that f(ot) equals 

rX(n) r X(r 2 ) ( N \ 

q- 2 V e(a-F(r)/g) / / e(/3.F( 7 ))d 7 +0 lV|gft|lM, (3.29) 

reM 2 Jo Jo V i=l / 



where X(r) = q y + 1J . Using the fact that \X — X(r)\ < q for all r S [q], we 

see that (|3.29p is equal to 

q- 2 V e(a-F(r)/g) / / e (/3 ■ F( 7 )) d 7 + O I Xq + X V I^X** I , 
reM 2 ■ / ° 70 V i=i / 

as required. □ 
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With these bounds in hand, we are able to prove the theorem advertised at the 
start of this section. 

Proof of the Theorem \3.S\ . Let r = 2sa + A s + s\ , with E\ sufficiently small (to be 
determined later). The result is vacuous if a < 0, so we may assume that A s < 1/3. 
It then follows that r < 1. By Dirichlet's principle, for each i with k{ > 2 we can 
find co-prime integers b{, qi with 1 < % < X ki ~ T and 

\ch - bi/ qi \ < qr l X T ~ k \ (3.30) 

Let C\ be the absolute constant in Lemma [3 .71 Using (|3.30p . notice that if y, z E [X] 
satisfy ()3.19p , then we have 

\\L(y - z)h/ qi \\ < dX 1 ^ + LX l+T - k >qr\ (3.31) 

For each choice y E [X], the number of residue classes modulo qi containing some 
z E [X] satisfying (pOTT) is at most CiX 1 "^ + LX l+T - k - + 1. Let D denote the 
maximum, over all y E [X], for the number of choices for z E [X] satisfying (|3,19p . 
Then we have 

D < (C 1 X 1 - k ^q l + LX 1+T -^ + l){q^LX + 1) 
<C qiX 1 '^ + XqT 1 + 1. 
Using Lemma 13.71 we see that 

|/(a)| « X 2 log(2X) 2 (X A (^X-^ + g ri + X^^ . 

By the lower bound (|3.ip . we have 

x -2sa-A s < + ^r 1 + X- 1 ) log(2X) 4s 

«(X- + gri)log(2X) 4s . 
Since r > 2scr + A s , we must have 

Qi < X 2s,7+A Mog(2X) 4s . (3.32) 

Since 1 > 2so~ + A s , this implies that the right-hand side of (|3.3ip is strictly less 
than q^ 1 (provided X ^$ 1). It follows that (|3.3ip implies qi\L(y — z)bi, which in 
turn implies qi\L(y — z), since (qi,bi) = 1. Hence it follows from the assumption 
(|3,19P that qi\L(y — z) for all i E I m . Let Q m denote the lowest common multiple of 
the set {qi : i E I m }. Then the number D satisfies D <C XQ^ 1 + 1- Utilising Lemma 
13.71 again, we obtain the bound 

Q m « X 2sa+A ° log(2X) 4s . (3.33) 

Let Q = [Qi,Q 2 ], so that Q « x 4sa+2A ° log(2X) 8s by (ET33]l . Since a is strictly less 
than 1 ) T 3A j s , we can (on taking e\ sufficiently small) find a real fi satisfying 

a < n < \ (1 - (4s + - 2A - r) . (3.34) 



As the space of linear binary homogeneous polynomials has dimension 2, there are 
at most two indices i with ki = 1. We can therefore use Dirichlet's principle to 
find a positive integer 1 < t < X 2/i , along with cij E Z (fej = 1) which are together 
co-prime to t and such that 

\t(Qai) - ai| < (fcj = 1). (3.35) 
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Set q = tQ. For i with ki > 2, let us define a, = (q/qijh. Then the iV-tuple 
a = (ai, . . . j ojv) satisfies (g, a) = 1 and 

I _ \^S X ^ (ki = l), ,„„ fi . 

It thus follows from Lemma 13.111 and (|3,34p that 

\f(a) - V(a; q, a)| < X 2 ^ + x 1+2 ^+ 4sCT + 2As + T log(2X) 4s 



Hence by the lower bound (|3.ip . we have |V(o;;g, a)| 3> X 2 a . Combining this, 
together with Lemma 13.91 and Lemma 13.101 , we see that for any e > we have 



N 



q + ^2\qai-ai\X k > « 



i=l 

Taking X sufficiently large (in terms of s, e and $), we obtain the theorem. □ 

4. The Asymptotic Formula 

In order to prove our density result, Theorem[L3j we need to estimate the number 
of solutions to (jl.5p when the variables Xj are restricted to the box [X] 2 . 

Definition 4.1. Given a finite set i C Z 2 , write R c $(A) for the number of tuples 
(xi, . . . , x s ) in the set A s satisfying 

d* u '*(xi) + --- + a,$ tt ' ,; (x a ) = (0<u + v<k). (4.1) 
When A = [X] 2 , we simply write R C ^(X). 

The Hardy-Littlewood method gives an asymptotic for R C) $(X), an asymptotic 
whose main term is a product of local densities, which we now define. 

Definition 4.2. Let <f> denote a binary form of degree k, differential dimension N 
and differential degree K. Let {F\, . . . , F^} denote a maximal linearly independent 
subset of {$ u > v : < u + v < k}. When T > 0, define X T (y) = Tmax{0, 1 - T\y\} 
and 

„ s s 

li T = Mt(c) = / x t(^2 c i F i(Ti)) • • • x t(^2 C i- F ^(7i))d7 

• 7[0 ' 1]2S 3=1 3=1 

The limit doc = o"oo(c) = Huit^-oo fj,T, when it exists, is called the real density. Given 
a natural number q, we write 

s 

M{q) = M(g;c) = #{x € (Z/qZ) 2s : ^ Cj F( Xi ) = (mod g)}. 

For each prime p, the limit 

aJc) = lim p- H ( 2s - N ^M(p H ), (4.2) 

provided it exists, is called the p-adic density. 

The purpose of this section is to prove the following asymptotic formula. 
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Theorem 4.3. Let <3? be a non- degenerate binary form of degree k, differential di- 
mension N and differential degree K . Suppose that 

s > kN (log K + log log if + 26). (4.3) 

Then there exists 5 > such that for any choice of non-zero integers c\ , . . . , c s we 
have 

RcM X ) = Voo(Hcr p )x 2s - K + 0(X 2s ~ K - s ) (4.4) 

V 

Suppose in addition that c is a non-singular choice of coefficients for <I>. Then 

o-oo ]J a P > °- ( 4 - 5 ) 

V 

Remark 4.4. The O(l) constant in (|4.3[) can certainly be lowered from 26 if one is 
willing to implement the results of Jj3] more optimally. 

The proof of Theorem 14 . 3 1 pro ceeds by the usual Hardy-Littlewood dissection into 
major and minor arcs. 

Definition 4.5. Given a tuple of integers a = (a\, . . . , a^) and q € N, define the 
major arc centred at a/q to be the set 

3%, a) = \\ ai -ai/q\\ < q^X^ (1 < i < iV)}. 

Define the major arcs SUt to be the union of the sets 9JT(g, a) with 1 < q < X 1 / 4 
and a € [q] n subject to (q, a) = 1. Define the minor arcs to be the complement 
m = T N \ JOT. 

One can show that for X S> 1 the major arcs are disjoint. We can therefore 
define the function V(a) to equal V(a; q, a) when a £ £OT(<7, a) C 9K, and equal 
otherwise. 

Lemma 4.6. Whenever s > k(N + 1) + 1 there exists 5 > suc/i t/iai 

f V(c 1 a)---V(c s a)dot=Z6X 2s - K + 0(X 2s - K - 5 ), (4.6) 

w/iere 



3 = 3(c) = 




an<i 

oo 

& = &(c) = J2<l~ 2s E 5(g,cia)-..5(?,c a a). (4.8) 

(?,a)=l 

Proof. Define A(q) to be the sum 

A(q)= £ ^ 2 ^(9,cia)---^, Cs a), (4.9) 
ae[«]* 

(9,a)=l 

and let X(/3; X) denote the product 

Z(/3;X) = J(ci/3;X)---/(c s /3;X). 
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Then by disjointness of the major arcs, and a change of variables = (cti — ai/q)X ki : 
we have 

[ V( Cl a)---V(c s a)da = X- K V A{q) I llAI^,..,^!^;^, 

1< ? <XV4 ^ 

(4.10) 

where = EIi^vHr 1 ^ 174 , q^X 1 ^]. Let ^ = - 1) > 0. By LemmaElO] 

and the AM-GM inequality we have 

N 

l(PlX~ k \. . .,(3 N X~ kN ;X) « X 2s H(l + \pi\y^ +Sl . (4.11) 

i=i 

It follows that 



[ Z{p 1 X- k \...,p N X- k "-X)df3 

JR N \B q 



s-1 



« X2S / *" (1+45l) d* / n , m + 4 , dx 
JxV* \Jr (1 + W) 1+4(51 

Combining this with another change of variables, we have 

/ l(f3 1 X- kl ,...,/3 N X- kN ;X)df3 = ZX 2s + 0(X 2s ~ Sl ). (4.12) 



Set 8 2 = g(§ - N - 1) > 0. By Lemma ET9l 
Thus 

£ \A(q)\« Q-^^X-^. 

q>X 1 /* < 7 >X 1 /4 

It follows that 

£ vl(g) = 6 + 0(X- fe ). (4.13) 

l<9<^ 1/4 

Combining ([4~T0j) . (j4TT2|) and (j4TT3"|) . we obtain the result. □ 

Lemma 4.7. There exist positive integers r and t such that for any s > r + 2Mi 
t/iere exists r > suc/i f/iai 

/ |/(a)| s da < X 2s -^~ T . (4.14) 
./m 

Moreover, one can ensure that 

r + 2tM < kN (log K + log log K + 26). 

Proof. Let us first find a large value for the expression 1 ( T 3 | ^ S occurring in Theorem 
Setting sq = M \k log(21iT)] , by Theorem 11.71 we have 



Therefore 



A S0 < Ke-^ 21K ^/ k < i. 



1-3A SQ l _ 



6s +3 ^ 7s + (7/2) •" 



SOLUTION-FREE SETS 



25 



Since <E> is non-degenerate of degree at least two, Lemma 12.21 guarantees that <3? has 
two linearly independent derivatives of degree one. This implies that N > 3 and 
K > 4. Hence 

k(J = 7s + (7/2) - M\og(21K) - 2.1og(21.4) < I" ( 4 ' 15 ) 

Let a E m and suppose that 

|/( Ci a)| > X 2 -. (4.16) 

Provided X is sufficiently large, it follows from Theorem 13.21 and (|4.15p that there 
exists q E N and integers ai, . . . , ajv, with ^(cjaj) — a«| < X 1 / 8 and q < X 1 / 8 . For 
X 2> c 1 sufficiently large, we have \cj\q < \cj\XV 8 < X 1 / 4 , so a 6 3Jl(cjq,b) C 971, 
a contradiction. Hence we must in fact have 

\f(c 3 o)\<X 2 -°. (4.17) 

Set 

t=\klog(KlogK)1 and r = [(r -1 ^ - */*] , (4.18) 

and let A t M be an admissible exponent for (tM, $). It suffices to prove (|4.14p for 
s = r + 2iM. By (|4.17p . Holder's inequality and Theorem 11.71 we have 

J f{c x oc) ■ ■ ■ f(c sa )da < X 2r -™ j> \f( a )\ 2tM d<x 

^ X 2s ~ K ~ ( rcF -AtM) 

Since ra > AtM> we obtain (|4.14p . 

It remains to show that r + 2tM < kNilogK + log log K + 21). Using the fact 
that k>2, N>3, K>4 and K < min {N 2 , k 3 }, we have 

2tM < t(N + l) 

< k(N + 1) tog (A" log K) + N + 1 

< A;iV(log K + log log K + 4) 



and 



< Ke^^a- 1 + 1 

„ 7A;M log(21 A) + 7M + (7/2) 
- log K + 

< 22kN. 

□ 



Proof of Theorem 473. Let r and f be defined as in the proof of Lemma 14.71 We 
deduce the theorem under the weaker assumption that s > r + 2tM. From this 
assumption, it follows that s = u+2vM, where u > 1 and v > kM (K log(K log K) + 
6). Therefore A v m < e~ 6 < 3/4. Combining this with Theorem 11.71 Lemma 13.111 
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and Holder's inequality, we see that there exists j € [s] such that 

/ (/(cia) • • • f(c a a) - V(cia) ■ ■ ■ V(c s a))da 
Jm 

<^X 2u -i( K j\f(a)\ 2vM da + j \V( Cj a)\ 2vM da 

Using this, together with Lemma 14.61 and Lemma 14,71 we see there exists T2 > 
such that 

j> /(cia) • • • /(c s a)da = 6^ 2s ^ + 0(X 2s - R - T2 ) (4.19) 

It remains to show that 2 = Coo ; that © = JT p a p and that these quantities are 
positive under the appropriate non-singularity hypotheses. To prove 3 = &oo we use 
a method of Schmidt [16J, as described by Parsell [12 1. For a positive real T, define 

K T (P) = ( ^ff^ ) . Kr<jS) = K T (Pi)-KT<J)n). 
It follows from Baker [2j Lemma 14.1] that 



Jr (4.20) 
= Tmax {0, 1 — T\y\} . 

In particular, this Fourier transform is always non-negative. Write 

I(/3) = I(/3; 1) = f e(/3-F( 7 ))d 7 and Z(/3) = I( Cl /3) • ■ ■ I(c s /3). 

i[0,l] 2 

By Fubini's theorem, we have that 

„ s s 

\i T =\ K T (^c t F l { 1 S)---k T (^c t F N { li ))&l 

J[o,i] 2s i=1 ' i=1 

K T (f3)l((3)d/3 

Lemma 13.101 and the AM-GM inequality ensure that, for any e > 0, we have the 
bound 

j(/3)< £ n (i+[Air^ +£ , 

l<i<N 

and a simple estimate reveals that 

1-Kt(P) <min{l,|/3| 2 T- 2 } . (4.21) 

Therefore 

3-/*r« f min{l,|/3| 2 T- 2 } TT (1 + |/3 i |)-^ +£ d/3 
^ l£i<N 

« / i II (l + lftl)~^ +£ d/3+ / 1 \(3\ 2 T- 2 d(3. 

J\f3\>T3N J\/3\<T3N 

Using s > kN, we see that 



3 = lim fir = (Too. 

T-¥oc 



SOLUTION-FREE SETS 



27 



Next, let us suppose that there exists a non-singular real solution to (jl.5|) . Writing 
P(x) for Y^,i=t c «F(x2i-i, X2i), it follows that there is some £ 6 M 2s for which P(£) = 
0, along with 5 C [2s] such that \S\ = N and 

det ( ^(Ol ^ 0. 

The translation-dilation invariance of (j 1 . 5 j) ensures that we may assume that £ € 
(0, l) 2s . Let [2s] \ S = {1{N + 1), . . . , /(2s)}. Define the function p : R 2s -> M 2s by 



A (7) 



Pi(j) if 1 < * < JV, 
v 7, w if iV < i < 2s. 

Let r/ = p(£), so that rji = for 1 < i < N. Notice that 



detp'(7)| 



det (||(7) 



By the Inverse Function Theorem, there exists an open set U C [0, l] 2s which con- 
tains £ and an open set V containing r\ such that p is a homeomorphism from [/ to 
V. Define the constant C\ = Ci($,s) by 



max 
7e[o,ip 



Using positivity of the Fourier transform Kt and the fact that U C [0, l] 2s , we have 

k T {p l { 1 ))---k T {p N { 1 ))& 1 > [ k T {p l ( 1 ))---k T (p N { 1 ))d 1 . 

[0,lp Ju 
This latter integral is in turn bounded below by 
1 



Ci 



#t(pi(t)) ■ ■ ■ k t{pn{i))\ det p' (7)^7. 



v 



By a change of variables this equals 



1 



— / K T (Ci)---^T(Civ)dC. 

W JV 

Since V is defined independently of T, there exists e = e($, s) > such that 
if \Ci — r]i\ < e (1 < i < 2s), then £ G V. Let Wy denote the set of £ for which 
101 < (2T)" 1 (1 < i < N) and \Q - T)i\ < e (i > iV). Then for T > (2s)- 1 , the set 
Wt is contained in V. Moreover, for C, S an d 1 < i < -/V we have Kt(Q) > T/2. 
Therefore 

' K T (Ci) • • • #r(6v)dC > / &r(Ci) • • • ^(Gv)dC 



> meas(Wr) 



2 jv 



> T -N {2£r -,_ 



2s-N- 



-ijV 



2 jv 



>$, S 1. 

Hence //t ^><j> s 1 for all sufficiently large T. 
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Let us now turn to the singular series &. Recalling (|4,9|) . for each prime p define 

oo 
h=0 

By Lemma 13.91 this series is absolutely convergent for s > k(N + 1). 

Let q and r be coprime positive integers. By Euclid's algorithm, any pair x of 
residues modulo qr can be represented uniquely in the form ry + qz with y 6 [q] 2 
and z € [r] 2 . It follows that for a, b E "L N we have S(qr,ra + qh) = S(q,a)S(r,h). 
Again, by Euclid's algorithm, each A-tuple a' of residues modulo qr with (a', qr) = 1 
can be represented uniquely in the form ra + qh with a E [q] N , (a, q) = 1 and 
b E [r]^, (b,r) = 1. A similar argument therefore gives A(qr) = A(q)A(r). 

Let pi, . . . ,p m denote the primes bounded above by X. Using multiplicativity of 
A(q), together with Lemma 13.91 and the fact that | — N = 1 + 2e for some e > 0, 
we have 

oo oo oo 

n^)-6=E- E ^•••p£r)-I>te) 

p<X /li=0 /lm=0 <?=1 

« E 

< E 9 JV+£-s/fe ^0 as A" — y oo. 

By orthogonality 

M(p H )= P~ NH E e(a-P(x)/p ff ) 

a6[p H ] N i=l 

Partitioning the sum over a according to the value of (p , a), we see that M(p H ) is 
equal to 

p H(2s-N)J2A(p h ). 
h=0 

It follows that (3 = rip "?- 

To show positivity of 6, we begin with the following result from elementary linear 
algebra. 

Lemma 4.8. Let h,H be non-negative integers with H > h + 1. Suppose that A 
is an n x n integer matrix with p h \\ det A. Then the image {A • x : x E (Z/p^Z) n | 
contains the subgroup {p h y : y E (Z/jr^Z)™} . 

For a proof of this lemma, let Aij denote the r?'-minor of A, obtained from A by 
deleting the ith row and jth column. We define the adjunct matrix of A by 

adi(A) = ((-l) i+j A ji ) 1 ^ n . 

Then we have the identity 

A • adj(A) = deb(A)I n . (4.22) 
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Since p ||det(>l), we have det(A) = up h where u is a unit in Z/p H Z. Let y G 
(Z/p H Z) n . Then 

p h y = &et(A)(u- l y) 

= A-(u- l adj(A)-y), 

as required. This completes the proof of Lemma 14.81 
Given a subset S C [2s] , define the Jacobian matrix 

JP(X; * )= (!P (X) ) 

\ UJj J J l<i<N,j€S 

When \S\ = N we define Ap(x; S) to be the determinant of Jp(x;S). Given a 
positive integer h, let Bh(p H ) denote the set of x € (Z/p H Z) 2s with P(x) = mod 
p H and for which there exists S C [2s] with \S\ = N and p h \ |Ap(x; S). The following 
claim is a version of Hensel's lemma. 

Claim. For H > 2h + 1 we /taue i/ie bound 

\B h (p H+1 )\>p^- N ^\B h (p H )\. (4.23) 

Fix x G (Z/p H Z) 2s with P(x) = mod p H and 5 C [2s] with \S\ = N and 
p /l ||Ap(x; S). For each j ^ S choose yj G [p] and define 

z . = \ x 3 

J l^+p^ His). 

Let w G Z 2s be subject to the condition that Wj = if j ^ S. By the binomial 
theorem and the fact that 2(H — h) > H + 1, we have 

P(z + p^w) = P(z)+^- /l J P (z;5)-(u; J ) je 5 (mod/ +1 ). (4.24) 

Since P(z) = P(x) = mod p H , we see that p h divides every entry in the iV-tuple 
of integers P(z)/p H ~ h . Hence 

-P(z)/p^ G [p h y : y G (Z/p h+1 Z) N ] . 

Notice that Ap(z;S) = Ap(x;5) modp h+1 , and so p h \ |Ap(z; S). Therefore, by 



Lemma |4.8[ for each j E S we can find wj G Z/p h+1 Z so that 



J P (z; S) • K) i6S = -P(z)/p^ (mod p fe+1 ). 

Moreover, since H — h > h + 1, we have Ap(z + p H ~ h w, S) = Ap(x; S) mod p h+1 . 
Hence 

z+p H " fc w€iB h (p H+1 ). 

Since tOj = if j ^ 5, we see that for each choice of z, the sum z + p H ~ h w gives 
a unique element of Bh{p H+1 )- As there are p 2s ~ N choices for z for each choice of 
x G Bh{p H ), the claim follows. 

Suppose there exists x G Q 2s such that P(x) = and S C [2s] such that the 
Jacobian matrix Jp(x;S) is non-singular over Q p . By homogeneity of the Pj we 
may assume all the entries of x are p-adic integers. Hence there exists a non- 
negative integer h such that |Ap(x; S)\ p = p~ h . Take any y G Z* such that y = x 
mod p 2h+1 . Then p fe | |A P (y; S) and P(y) = mod p 2h+1 , so y G ^(p 2/l+1 ). In 
particular, \Bh(p 2h+1 )\ > 1. Iterating the bound ([4.230 obtained in the previous 
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lemma, we have established that there exists a non-negative integer h = h($,p) 
such that for all H > 2h + 1 we have the lower bound 

|BUp H )l>f (2s_jV)(if ~ 2fe-1) - (4-25) 
Clearly M(p H ) > Bh(p H ), so inputting this into the relation (|4.2|) . we obtain 

T{p) = lim p- H(2s -^M{p H ) 

H-^oo 
> p -(2s-JV)(2fe+l) 

The absolute convergence of the product & = Y\ p T(p), with all T(p) positive, 
implies that 

lim l[T(p) = l. 

X^roo J - J - 
p>X 

In particular, there exists Xq such that for all p > Xq we have 

n t(p) > i/2. 

P>X 

Since T(p) > for all p < Xq, we also have n p <x -^(p) > 0- Therefore 

© = n T ^ n > °- 

p<X p>X Q 

□ 



5. Density bounds for solution-free sets 

This section is dedicated to the proof of our main theorem. 

Theorem 5.1. Let <3? £ 7,[x,y] be a binary form of degree k > 2, differential di- 
mension N and differential degree K , and let c E 7L S be a non-singular choice of 
coefficients for $ with c\ + • • • + c s = . Suppose that s > kN(log -fT+log log K+27). 
Then any set A C [X] 2 containing only diagonal solutions (xi, . . . , x s ) E A s to the 
system of equations 

ci$ M ' v ( Xl ) + • • ■ + c s $ u > v (x s ) = (u + v>0), (5.1) 

satisfies the bound 

\A\ <X 2 (loglogX)- 1/(s - 1) . (5.2) 
Here the implicit constant depends only on c and 

Remark 5.2. We will prove Theorem 15.11 under the assumption that $ is non- 
degenerate. The degenerate case follows from the same argument, but the superior 
bounds available in the standard Vinogradov mean value theorem ensure that, in 
this case, the lower bound on the number of variables required can be decreased. 

In order to prove Theorems 15.11 it is useful to work with translates of sets of the 
form [X] 2 . We define a half-open square to be a subset of M 2 of the form 

Q = x+(0,X] 2 , 

and call X the side-length of Q. Let us write [Q] to denote the set of integer points 
in Q, namely [Q] = Q n 1? . 

We reduce the proof of Theorem 15. II to the following density increment result. 
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Lemma 5.3. Given the assumptions in Theorem \5.1\ there exist absolute constants 
t = r(k), C = C(c, $>) and c = c(c, 3>) > snc/i i/iai for any 5 > and any reaZ 
X > exp(C/5), if Q C M? is a half-open square with side-length X and A C [Q] 
satisfies \A\ = S\[Q]\ and 

RcM A ) < c6 s X 2s ' K , (5.3) 

then there exists a half-open square Qi with side-length at least 2~ k X T , along with 
q € N and r G Z 2 , swc/t i/iai 

|^n (r + g- [Qi])! > ( < 5 + c^ s )|[Q 1 ]|. (5.4) 

Proof that Lemma \5.3\ implies Theorem \5.1[ Let us suppose that A C [X] 2 contains 
only diagonal solutions to (|5.ip and let r, C and c be as in Lemma 15.31 We aim 
to construct a sequence of quadruples (Qi, Ai, Xi, Si) satisfying all of the following 
conditions. 

(i) Qi is a half-open square of side- length Xj. 

(ii) Ai C [Qi] with i4< = (5i | [Qi] | . 

(iii) Ai contains only diagonal solutions to (|5.1|) . 

(iv) X +1 > 2- k X[. 

(v) >^ + «5f. 

Taking Qq = Q, Aq = A, Xq = X and <5o = |Ao|/|[Qo]|, we have our initial quadru- 
ple. Let us suppose we have constructed (Qj,Aj,Xj,8j) for all 1 < j < i. In order to 
apply Lemma [5.31 we must estimate R c ^(Ai). First notice that Af contains exactly 
\Ai\ solutions to (|5.ip with xi = • • • = x s . Any other solution counted by R Ct $(Ai) 
must have all Xj contained on some affine line L, where \L D Aj| > 2. Any 2-set 
{x,y} C Ai is contained in exactly one affine line L. Letting C denote the set of 
affine lines which intersect Ai in at least two places, we therefore have 

RcM A i) < \ A i\ + Yl \ Ln ^ 
Lec 



{i\\ s 



< \Ai\ + ( ) max|Ln [Qi]| 



2 / Lec 
<X l 4 max|Ln[Q i ]| s . 

The set of integer points in L n Qi projects injectively onto either the x or y axis, 
with image equal to a set of integer points contained in a subinterval of length Xj. 
Hence \L n Qi\ < Xi + 1. Thus 

R c M A i) < x° +i . 

Our assumption on the size of s certainly ensures that 2s — K > s + 4, hence taking 
C sufficiently large in the assumption 

Xi > exp(C/<5i), (5.5) 

certainly implies that 

RcMA*) ^ 2 S+3 X/+ 4 < c8lX?— K . 

Assuming ()5.5p . we can therefore employ Lemma 15.31 to obtain a half-open square 
Qi+l of side- length Xj + i > 2~ k XJ , together with q and r such that 

\A D (r + q ■ [Q i+1 })\ > (Si + cSf)\[Q i+1 ]\. 
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Let us set A i+ i = {x G [Qi+i] : r + qx e ^LJ and <5 i+ i = |Aj + i|/|[Qj + i]|. The fact 
that ci + • • • + c s = means the system (|5.ip is translation-dilation invariant. Using 
this, it follows that if (xi,...,x s ) G Af +1 is a solution to (|5.ip . then the tuple 
(r + gxi, . . . , r + qx s ) is a solution to (|5.ip in ^4f. Since ^4, has only diagonal solutions 
to (|5,ip . it follows that (xi,...,x g ) is itself diagonal. Assuming (|5.5p . we have 
therefore obtained another quadruple (Qi+i, ^4j+i, satisfying conditions 

(i)to(v). 

As long as (|5.5p holds, we can iterate this construction. After [c~ 1 <5 1-s ] such 
iterations we have a density Si of size at least 26. After a further |"c~ 1 (2(5) 1 ~ s "| such 
iterations, we have a density of at least 45. Thus, setting L = [log 2 (5 -1 )J, we see 
that after a total of 

(=0 

iterations, we have a density of 2 L+1 5 > 1 (a contradiction). Hence (|5.5p cannot 
hold for all < i < I. Thus for some i G {0, 1, ...,/} we have 

exp(C/<5) > exp(C/6i) > X { 

> XT_ x 2~ k > XC 2 2~ H1+T) > ■ > X(f 2- fe /( 1 ~ r ) (5.6) 

> x Tl 2- k l^ T \ 



Taking logarithms in (|5.6p . we therefore have 



C/6 >t* log X 



Notice that i < I < 2c 1 <y s . So on taking logarithms again we have 

logger 1 + k(l -t)" 1 ) + 2c" 1 5 1 - s log(l/r) > log log X. (5.7) 
Crude estimation shows that the left hand side of (|5,7p is Ofe,*^ 1-5 ), a s required. □ 

We begin the proof of Lemma [5.31 with the following general result on partitioning 
phase polynomials into approximate level sets. 

Lemma 5.4. Let P(x\,X2) denote a real polynomial of degree k. Set 

t- 1 = 24 k (kl) 2 2 k ^ k+1 ^ 2 . 

There exists a positive constant C = C(k), such that for any half-open square Q 
of side-length X, we can find half-open squares Qi, . . . , Q n each with side-length at 
least 2~ k X Tk , along with qi G N and r-j G 1? , such that the sets ri + qi- [Qi] partition 
[Q], and furthermore for any x, y G r, + qi ■ [Qi] we have 

\\P(x)-P(y)\\<CX-*>. (5.8) 

Proof. By Taylor's formula 



P(r + qx) = ^P U ' V (qx) 

u,v>o (5.9J 

= q k F(x) + G(x;r,q), 

where F(x) is a homogeneous real polynomial of degree k, and G(x) = G(x; r, q) is 
a polynomial of degree strictly less than k. Set 

F(x) = ^x\x k -\ (5.10) 

0<Z<fc 
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and let a^ 1 = 6k2 k . It follows from Baker [21 Theorem 8.1] that there exists C\ = 
C\{k) such that for any Y > 1 there is some some 1 < q < Y satisfying 

\\q k m\\ < C x Y- ak (0 < l < k). (5.11) 

Let Y = X 1 / 2 in (|5.1ip . where X is the side-length of Q. Partitioning [Q] into 
congruence classes modulo q, we have 

[Q}= |J r + g-[Q(r)], 

re[g] 2 

where Q(r) is a half-open square of side-length X/q. Let us set 

Then we can partition each Q(r) into t 2 half-open squares Q(r,t) (t G [i] 2 ), each 
of side-length X/(qt). For fixed r and t let us pick a(r, t) G [Q(r, t)]. Then we 
have [Q(r, t)] = a(r, t) + [Q'(r, t)], where Q'(r, t) is a half-open square of side-length 
X/(qt) satisfying 

Q'(r,t) C [-X/(^),X/(gt)] 2 . (5.12) 

It follows that there exist pairs b(r,t) E Z 2 (r £ [g] 2 , t G [T]) such that the set [Q] 
is equal to the disjoint union 

U |J (b(r,t) + g-[Q'(r,t)]). 

rG[q] 2 te[T] 



Clearly X/(qt) < X a *l^ k \ Since q < X 1 / 2 < X 1 -'^ 4 ®, we also have 

X 

Hence the side-length of each Q'(r,t) is between \X CTk / { - Ak ^ and X CTfc /( 4fc ). It follows 
that for any x, y G [Q'(r, t)] we have 

||g fc (F(x)-F(y))|| « fc X^. (5.13) 

Write G Fi t(x) for the polynomial G(x; b(r, t), q). By induction, there exists a 
partition of [Q'(r, t)] into sets of the form Sj + g^ ■ [Q£(r, t)] (1 < i < m = m(r, t)), 
where Q^(r, t) is a half-open square of side- length at least 

2 1 - k - T k _ 1 J^CTfc T fc _ 1 / (4fc) 

and such that for any x,y G [Qj(r, t)] we have 

||G ri t(s i + g i x)-G rjt (s i + g i y)|| « fc X-^-i/( 4fc ). (5.14) 

Let us write <^(r, t) for ggj(r,t) and b^(r,t) for b(r,t) + gsj(r,t). Then [Q] is 
partitioned by the sets 

bJ(r,t) + ^(r,t).[Q' i (r } t)] (r G [q] 2 , t G [T], l<i<m(r,t)). 

Combining (j5.9j) . (|5.13p and (j5. 14|) . we see that for each x, y G [Qj(r,t)] we have, 
on writing b' = b^(r,t) and q' = g^(r, t), that 

||P(b' + g'x) - P(W + g'y)|| < ^-^-i/C**). 

A simple calculation reveals that akTk-H '(4fe) = Tk, as required. □ 
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Proof of Lemma \5.cH Let us define 

/ A (a) = ^l A (x)e(a-F(x)), 

X 

together with /(a) = /[qi(q!) and <m(c>0 = Ia((^) — Sf (<*■)■ Using translation 
invariance of (|5.1j) . we have i? Ci $([Q]) ~ i? C) $(X). Let ci equal the quantity 
o~oo(c)Y\ p o~p(c) from Theorem 14.31 Provided we take C in Lemma 15.31 sufficiently 
large, so that X > exp(C/5 _1 ) 3> c ,$ 1, we can use Theorem 14.31 to ensure that 



RcMiQ]) > ha* 



2s-K 



(5.15) 



The assumption that c is a non-singular choice for $ implies that c\ is positive. Let 
us take c in Lemma [5.31 sufficiently small, say c < \c\. Combining (|5,3p . (|5.15p . 
orthogonality and Holder's inequality, we have 

\ Cl 8 s X 2s - K < \R C ^(A) -5 S R C MIQ})\ 



< j> \ f A {c\cx) ■ ■ ■ f A (c s a) - 5 s f(cia) ■ ■ ■ f(c s a)\da 

< sup \g A (a)\X 2e l{\f A (cx)\ 2t + \f( a )\ 2t )da, 

Oi J 



where s = 1 + e + 2t, for some e G {0, 1}. Since 2t > kN (log K + log log K + 26), 
we can use Theorem 14.31 (together with the underlying Diophantine equation) , to 
conclude that 

j>[f A {oc)] 2t dot< j>\f{cc)\ 2t dcx 
« C)# X^~ K . 

Setting b A (x) = 1 A (x) — 51\q](x), we see that there exists a G such that 

\9aM\ > c ,* S s X 2 . 



| Y, ^(x)e(a • F(x)) 



(5.16) 



Let r = Tfc and Ci = Ci(fe) be as in Lemma 15.41 and consider the polynomial 
P(x) = a ■ F(x). Then there exist half-open squares Qi, . . . , Q n each of side-length 
at least 2~ k X T , along with r-j and qi (1 < i < n) such that the sets r i + qi ■ [Qi] 
partition [Q], and for any x, y G r» + qi ■ [Qi] we have ||-P(x) — P(y)|| < C\X~ T . 
Notice that this implies that |e(P(x) - e(P(y))| < X~ T . Thus 



53 &A(x)e(a ■ F(x)) < £ £ ^(x)e(P(x)) 

xG[X] 2 i=l xgr 4 + gi -[Q»] 



= E E ^x) +o(i-). 

i=l xer 4 +iJi-[Qi] 

We can take C in Lemma 15.31 sufficiently large to ensure that the lower bound 
X > exp(C/<5) implies that the 0(X 2_T ) term above is at most half the size of the 
right hand side of (|5.16|) . We thereby obtain that 



E E Mx) 

i=l xeri+qi-[Qi] 



s v 2 



» c $ 5 s X 



(5.17) 
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Let X denote the set of % G [n] for which ^ xej .. +gi .rg.i b^(x) > 0. Since bA has 
average zero, we can add ^ x ^( x ) t° the left side of (|5.17|) . to obtain 

J2( E ^(x))»c,$5 s X 2 . (5.18) 

The density increment (|5.4|) now follows from the pigeon-hole principle, provided we 
take c = c(c, <3?) sufficiently small. □ 

Our originally advertised theorem, Theorem 11.31 now almost follows. It remains 
to show that kN(\ogK + log log if + 27) < ffc 3 log k(l + o(l)). We have the trivial 
bound N < k 2 and if < fciV < A; 3 . The bound iV < ^(1 + o(l)) takes a little 
more calculation, but follows from the fact that the number of linearly independent 
derivatives & u ' v with u + v = k — disat most max {k + 1 — d, d + 1}. 

Acknowledgements. The author would like to express his gratitude to Professor Woo- 
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